A Pre-Identification Method for Chinese Named Entity Recognition
نویسندگان
چکیده
In this paper, a pre-identification method for Chinese named entity recognition is proposed. Internal information of entity name like family name, first name in person name, feature word in place name and organization name do not needed. Through entity name guessing based on context keywords, pre-identification is realized. Definition of bidirectional potential entity name recognition, rough confirmation of potential entity name, segmentation word is proposed. To solve the possible ambiguity in entity name identification, the degree of segmentation and conjunction is presented as well as cascade recognition and final confirmation. Combining with this pre-processing method, performance will be improved by using internal information of entity name. Experiment proves that the method have a special advantage in recognition special entity name, ambiguity name and irregular name. In this paper, Chinese person name is taken as an example for entity name recognition. Nevertheless, the method is not limit to person name recognition but also a preidentification method for other entity name.
منابع مشابه
Improvement of Chemical Named Entity Recognition through Sentence-based Random Under-sampling and Classifier Combination
Chemical Named Entity Recognition (NER) is the basic step for consequent information extraction tasks such as named entity resolution, drug-drug interaction discovery, extraction of the names of the molecules and their properties. Improvement in the performance of such systems may affects the quality of the subsequent tasks. Chemical text from which data for named entity recognition is extracte...
متن کاملNamed Entity Recognition in Persian Text using Deep Learning
Named entities recognition is a fundamental task in the field of natural language processing. It is also known as a subset of information extraction. The process of recognizing named entities aims at finding proper nouns in the text and classifying them into predetermined classes such as names of people, organizations, and places. In this paper, we propose a named entity recognizer which benefi...
متن کاملتشخیص اسامی اشخاص با استفاده از تزریق کلمههای نامزد اسم در میدانهای تصادفی شرطی برای زبان عربی
Named Entity Recognition and Extraction are very important tasks for discovering proper names including persons, locations, date, and time, inside electronic textual resources. Accurate named entity recognition system is an essential utility to resolve fundamental problems in question answering systems, summary extraction, information retrieval and extraction, machine translation, video interpr...
متن کاملA Joint Chinese Named Entity Recognition and Disambiguation System
In this paper we describe an integrated approach for named entity recognition and disambiguation in Chinese. The proposed method relies on named entity recognition (NER), entity linking and document clustering models. Different from other tasks of named entities, both classification and clustering are considered in our models. After segmentation, information extraction and indexing in the prepr...
متن کاملReal-time rich-content transcription of Chinese broadcast news
This paper describes the recent development of an Audio Indexing System for Chinese (Mandarin) broadcast news. Key issues of the three major components: automatic speech recognition, speaker identification and named entity extraction are addressed. The Chinese-language-specific challenges are discussed and our solutions are described. The recognition accuracy of the final system is comparable t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JSW
دوره 5 شماره
صفحات -
تاریخ انتشار 2010